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1.  INTRODUCTION 


A binary  tree  is  a finite  set  of  nodes,  either  empty  or  containing  one  node 
called  a root,  such  that  all  other  nodes  are  partitioned  into  disjoint  sets  which 
are  respectively  called  left  and  right  subtrees  of  the  root.  The  subtrees  also 
satisfy  the  definition  of  a binary  tree.  Thus,  a binary  tree  is  an  unlabelled 
rooted  arborescence  with  successors  of  at  most  degree  two  distinguished  only  as 
left  and  right. 

Figure  1.1  shows  a binary  tree  with  six  nodes.  The  root  node  is  shown  at 
the  top  and  is  connected  by  arcs  to  two  immediate  successor  nodes  which  are  the 
roots  of  its  left  and  right  subtrees.  Each  node  with  no  successors  (for  in- 
stance, the  left  subtree  of  the  root)  is  called  a leaf.  The  level  of  a node 
Indicates  how  deep  it  is  within  the  tree.  Thus  the  root  has  level  one,  its 
immediate  successor  nodes  have  level  two,  and  so  forth  down  the  occupied  por- 
tions of  the  subtrees.  The  height  of  a binary  tree  is  the  largest  occupied 
level.  A full  binary  tree  has  no  internal  vacancies  (unoccupied  node  positions). 


FIGURE  1.1  - A Binary  Tree  with  Six  Nodes 


Binary  trees  are  frequently  used  as  information  storage  structures  on 
digital  computers.  For  instance,  one  of  vhe  most  popular  methods  of  randomly 
retrieving  information  by  a key,  or  symbol,  is  to  store  the  key  data  in  a 
binary  tree.  To  search  for  a particular  symbol,  we  begin  by  looking  at  the 
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root  and  proceed  by  applying  Che  following  rules  recursively: 

1.  If  Che  symbol  maCches  Che  rooc  symbol,  Che  symbol  Is  found. 

2.  If  Che  symbol  is  "less  Chan"  Che  rooc  (according  Co  some  binary 
ordering  relation)  conCinue  Che  search  by  considering  Che  left 
successor  of  Che  rooc  as  Che  new  root  (of  Che  left  subtree). 

3.  If  Che  symbol  is  greater  Chan  Che  rooc,  conCinue  by  searching  Che 
rlghc  subtree. 

4.  If  there  is  no  rooc,  Che  symbol  is  not  in  Che  binary  tree. 

We  assume  for  simplicity  chat  all  symbols  are  distinct  wich  respect  Co  Che 
ordering  relation.  Otherwise,  Che  ordering  relation  and  search  must  be  modi- 
fied in  an  obvious  fashion.  The  construction  of  a binary  cree  for  use  by  such 
a search  scheme  may  be  performed  by  sequentially  examining  Che  key  symbols  Co 
be  inserted.  This  binary  tree  sore  is  a one  pass  ordering  procedure  which 
proceeds: 

1.  If  there  is  no  root,  insert  Che  symbol  as  Che  root. 

2.  If  the  symbol  is  less  than  Che  root  symbol,  continue  by  considering 
Che  left  subtree. 

3.  If  the  symbol  is  greater  than  the  root  symbol,  continue  with  the 
rlghc  subtree. 

As  an  example,  consider  Che  six  symbols  ABCDEF  and  a lexicographical  binary 
ordering  relation.  Suppose  that  Che  particular  permutation  of  symbols  ex- 
amined is  BDAFCE.  The  resulting  binary  tree  is  shown  in  Figure  1.2,  and  has 
structure  identical  to  the  tree  in  Figure  1.1.  Note  that  this  same  tree  may 
have  resulted  from  other  permutations  of  the  same  symbols,  for  instance 
BADCFE.  Therefore,  there  is  a many-to-one  mapping  of  key  symbol  permutations 
to  corresponding  binary  trees. 

SF 
/ 

E 

FIGURE  1.2  - A Binary  Tree  with  Inserted  Symbols 
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The  height  of  the  binary  tree  in  Figure  1.2  is  four  and  thus  the  maximum 
number  of  comparisons  required  to  insert  another  symbol  is  four.  Similarly, 
if  this  tree  is  used  for  retrieving  symbols,  the  maximum  search  length  for  a 
symbol  in  the  tree  is  four,  and  for  a symbol  not  found  the  maximum  is  five. 

A computer  implementation  of  a binary  tree  storage  structure  requires 
that  each  node  be  represented  by  its  key  symbol  accompanied  by  sufficient 
additional  information  to  identify  and  access  the  left  and  right  subtrees. 

This  is  usually  accomplished  by  use  of  a dense  array  of  rode  symbols  each 
with  left  and  right  pointers,  by  node  storage  via  address  calculation  into  an 
array  space  sufficient  to  store  all  possible  binary  trees  with  a given  number 
of  nodes  and  some  maximum  height,  or  by  some  similar  method. 

In  the  following  sections  we  study  this  widely  used  class  of  binary  trees 
in  order  to  provide  information  useful  in  examining  algorithms  based  on  this 
storage  structure.  A closed  form  counting  formula  for  the  number  of  binary 
trees  with  n nodes  and  height  k is  developed  and  restated  as  a recursion  more 
useful  computationally.  A generating  function  for  the  number  of  nodes  given 
height  is  developed  and  used  to  find  the  asymptotic  distribution  of  binary 
trees.  An  asymptotic  probability  distribution  for  height  given  the  number  of 
nodes  is  derived  based  on  equally  likely  binary  trees.  This  is  compared  with 
a similar  result  for  general  trees. 

Random  binary  trees  (those  resulting  from  the  binary  tree  sorting  algo- 
rithm applied  to  random  strings  of  symbols)  are  counted  in  terms  of  the  mapping 
of  permutations  of  n symbols  to  binary  trees  of  height  k.  An  explicit  formula 
for  this  number  is  given  with  an  equivalent  recursive  definition  for  computa- 
tional use.  A generating  function  is  derived  for  the  number  of  symbols  given 
height.  Lower  and  upper  bounds  on  random  binary  tree  height  are  developed  and 
shown  to  approach  one  another  asymptotically  as  a function  of  n,  providing  a 
limiting  expression  for  the  expected  height. 
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The  random  binary  crees  are  examined  further  to  provide  expressions  for 
the  expectations  of  the  number  of  vacancies  at  each  level,  the  distribution  of 
vacancies  over  all  levels,  the  comparisons  required  for  insertion  of  a new 
random  symbol,  the  fraction  of  nodes  occupied  at  a particular  level,  the  number 
of  leaves,  the  number  of  single  vacancies  at  each  level,  and  the  number  of  twin 
vacancies  at  each  level.  A random  process  is  defined  for  the  number  of  symbols 
required  to  grow  a tree  exceeding  any  given  height. 

Finally,  an  appendix  is  given  with  sample  tabulations  and  figures  of  the 
distributions. 


2.  NUMBER  OF  BINARY  TREES  OF  A GIVEN  HEIGHT. 

In  this  section  we  consider  the  problem  of  finding  the  number  of  binary 
trees  with  n nodes  and  height  R.  Denote  this  number  by  t(n,k)  , where  n 
and  k are  positive  integers.  Since  t(n,k)  - 0 unless 

k < n < 2k  - 1 (2.1) 


we  are  only  concerned  with  integers  n « 1,2,...;  k ■ 1,2,...  satisfying  the 
inequality  (2.1). 

An  explicit  formula  for  the  numbers  t(n,R)  can  be  obtained  by  the  fol- 
lowing simple  combinatorial  argument.  Consider  the  class  of  all  binary  trees 


with  n nodes  and  height  k which  have  exactly  m^  nodes  at  the  level  j+1, 

j"l,...,k-l.  Let  nij  ■ + Tj  , where  and  r^  are  numbers  of  nodes 

which  are  left  successors  and  right  successors  of  nodes  at  the  level  j.  In 

other  words,  t.  is  the  number  of  nodes  at  level  .-j+1  at  the  end  of  left 

^ / m ' 

going  arcs  emanating  from  nodes  at  level  j.  These  can  be  selected  in  [ 


m 


ways,  and  the  r^  nod'  ■>  in 


j-1 
rj  ■ 


ways.  Thus,  the  total  number  of 


ways  to  arrange  the  arcs  between  nodes  at  levels  j and  j+1  is  given  by 
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Since  mg  ■ 1 (the  root)  and  m^  +•••+  m^  ■ n-1  with  mj  >,  1 f°r  J“1 k 

we  obtain  from  (2.2)  the  formula 


t(n,k) 


(2.3) 


where  the  summation  Is  over  all  integers  , j»l, . . . ,k  satisfying 


and 


® j - 1 . j-1, ...,k-l  , 


+• • •+  " n-1  . 


The  formula  is  valid  for  n>l  and  k satisfying  (2.1),  for  n-1  we  have 
trivially  t(l,l)  - 1. 

Although  (2.3)  is  an  explicit  formula  for  the  number  t(n,k)  it  is  not 
very  convenient  for  calculation.  An  alternate  way  is  through  a recurrence. 

Let 

t(n,k)  - T(n,k)  - T(n,k-1),  n >_  1,  k >_  1 (2.4) 

where  T(n,k)  is  the  number  of  binary  trees  with  n nodes  and  height  not 
exceeding  k.  If  we  define 

!1  if  n-0 

, (2.5) 

0 if  n>Q 

and  1(0 ,k)  - 1 for  k _>  0 , we  obtain  the  recurrence  relation 

n 

- ^ T(J,k)T(n-j,k)  , 
j-0 


T(n+l,k+l) 


(2.6) 


valid  for  n>0  and  k>0.  This  follows  from  the  fact  that  the  class  of  all 
binary  trees  with  n+1  nodes  and  height  not  exceeding  k+1  can  be  partitioned 
Into  n+1  subclasses  according  to  the  number  of  nodes  j In  the  left  subtree 
of  root.  Since  the  heights  of  both  the  left  and  right  subtrees  must  not  exceed 
k the  number  of  trees  in  the  j-th  class  Is  the  product  T(j ,k)T(n-j ,k) , and 
(2.6)  follows. 

Note  that  with  the  convention  (2.5)  the  recurrence  (2.6)  yields  automatically 
T(n,k)  - 0 for  n > 2^  - 1 , 


and  that  for  0 <_  n <_  k , T(n,k)  is  just  the  number  of  binary  trees  with 
nodes.  It  is  well  known  (see  [Knuth,  vol.3])  that  the  latter  are  Catalan 
numbers 


C ■ 

n 


n > 0 


n 


(2.7) 


so  that 


T(n,k)  * C for  0 < n < k. 

a — — 


(2.8) 


From  the  recurrence  (2.6)  one  easily  obtains  the  sequence  enumerators 
defined  by 


T(n,k>*“  , k > 0 . 
n>0 


(2.9) 


Since  the  right-hand  side  of  (2.6)  is  the  Cauchy  product,  we  have  immediately 

T(n+l,k+l)xn  - f£(x)  , 

n>0 

from  which  in  view  of  (2.5)  we  obtain 


with 


fk+i(x)  - 1 + x fk(x)  , k >_  0 , 


fQ(x)  - 1 . 


(2.10) 
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Note  chat  f^  (x)  Is  a polynomial  in  x , hence  If  x Is  regarded  as  a complex 
variable  , k«0,l Is  a sequence  of  entire  functions. 

We  now  show  that  this  sequence  converges  uniformly  in  a circular  region 

/ 

of  the  complex  plane,  specifically  that  as  k -►  « 


fk(z)  u(z)  uniformly  for  |z|  <_  -jr  , 


(2.11) 


where 


nm0 


(2.12) 


To  see  this,  note  that  with  C as  in  (2.7)  we  have  T(n,k)  <_  C for  all  k 


which  together  with  (2.8)  yields 


u>(z)  - ffc(z)  - zk  ^ (Cn-T(n, 


k))z 


n-k 


n>k 


so  that  for 


1*1 1 £ 


u(z) 


-Vz)l  i £ CnMn  iE'.*"  • 


n>k 


n>k 


which  is  a tail  of  the  expansion 


2 . 


This  result  will  now  be  used  to  develop  an  asymptotic  distribution  of 
tne  numbers  t(n,k)  as  k -*■  « . In  doing  this,  we  follow  the  method  of 
Renyi  and  Szekeres  used  in  [8]  for  a similar  problem. 

From  (2.4)  we  have  for  k >_  1,  t(0,k)  ■ 0 , 

^Pt(n,k)zQ  - fk(z)  - fk_x(z)  , (2.13) 

n>0 


which  are  entire  functions  for  every  k >_  1.  Hence  by  the  Cauchy  formula 

(z) 


X JT  V”  - W 

(n,k)  2.1  J zn+l 


dz  , 


(2.14) 
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where  we  cake  Che  circle  |z|  - as  Che  conCour  of  incegracion.  To  escimace 
Che  incegral  we  use  Laplace's  mechod  by  firsC  showing  ChaC  as  k -*•  ® Che  only 
significanc  conCribuCion  of  Che  incegrand  is  in  Che  viciniCy  of  posicive  real 


axis . 


F ($,z)  - 1 + -r  e^^z2  , -it  < $ <_  tt 


and  lec  for  $ fixed 


Fk+l(*’z)  " F<*'Fk<^'2))  * k i 1 • 


“ F($,z)  . 


Then 


fk(  i ^ " V*’1)  * k > 1 , 


i.e.  f,  are  iceraces  of  Che  funccion  F 
k 

u(z)  defined  by  (2.12)  sacisfies 


For  z * e*-^  Che  funcCion 


F($,w)  *»  <a  , 


and  is  given  by 


where  Re/ l-eil*>  _>  0 . The  curve  a*1^  , -it  < <p  <_  it  is  cherefore  Che 

curve  of  fixed  poincs  of  Che  funccion  F($,*).  For  any  parcicular  $ Che 
derivacive 


~ F(<p,z)  - F'($,z)  - e^z 


so  chac 


for  z - oj  we  have 


|F' (♦.«)!  - It  eUw|  - |l  - / l-e1*  | . 


The  locus  of  poincs  / 1-e  , -r  < $ <_  it  in  Che  complex  plane  is  an 

Lemniscace  of  iernoulli  wich  polar  eqaacion  p2  * 2 cos  8,  while  1 - / l-e^? 


8 


just  shifts  the  curve  one  unit  to  the  right.  (See  Figure  2.1) 


FIGURE  2.1 


Hence 

|F'(*,u)|  1 1 , 


(2.20) 


with  equality  if  and  only  if  $a0.  Thus,  if  all  fixed  points  (2.18) 

are  atractive  and  since  f^  are  iterates  of  the  function  F converging  to 
u this  implies  that  as  k •*  « 

fk(  \ e4)  - w(  i e1*)  + o(|l  - / l-e^[k  j (2.21) 


uniformly  for  | $ | _>  e > 0 . 


The  modulus  |l  - / 1 -e^  | is  a decreasing  function  of  |<{>|  in 
0 < |o|  <_  ir  with  maximum  1 at  $ * 0.  Further  denoting  u^(<j>)  + iv^($)  - 
■ 1 - /l-e*^  the  real  and  imaginary  parts  have  asymptotic  expansions 


(2.22) 


as  |ij>|  -*■  0 . Hence  for  sufficiently  small  |<£| 
and  thus  if  we  choose  for  instance  |$|  <_ 


we  obtain  from  (2.21) 
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as  k • 


fk(i*1*)*  “(! e1*) + °(e'Xn2k)  • 


(2.23) 


Rsaark?  The  same  result  can  also  be  obtained  more  directly  by  writing 
fk+i(z)  - u,(z)  - 1 + z f£(z)  - (1  - z u>2(z)) 

- z[fk(z)  - a,(z)][fk(z)  + w(z)]. 

Then  choose  e > 0 such  that  |bi  e*^)  j >_  e for  all  <t>  and  choose  K 
such  that  k _>  K=^|fk(-£  e*^)  - u,^  e*^)  | < e . Then  with  z m \ ei^  -or 
k>K 

lfk+1  " o)|  i ^lfk  “ “|(2|u|  + e) 

whence  for  all  m _>  1 

lfk+m  “ U1  - lfK  " “l^l1  “ ^“el*l  + e^m  • 

Hence  (2.23)  follows  by  the  same  argument  as  before. 

1 id, 

If  we  now  substitute  (2.23)  into  (2.14)  we  have  with  z ■ e 

f [*&•*)  -fk-i(i'1*)]‘"ln*d* 

-IT 

(2.24) 

(2.25) 

(2.26) 
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- £ j [*k(i  •**)  - £k-i(i  -1*)]  ^ <♦ ♦ o(.-** ) . 

1*1  < (¥*)* 

as  k • . 

To  estimate  the  remaining  integral  we  first  set  $ - 0 and  call 

\’fk(t)  • 

The  recurrence  (2.10)  yields 

Vi  ■ 1 + X “k  ■ “o  * 1 


from  which  by  substitution  Yk  ■ 2 - we  obtain  the  recurrence 


Tk+i  ■ \ • yo  * 1 • 


(2.27) 


Thus  is  the  k-th  iterate  of  the  function  -£n(x)  ■ x - x1 2  , and  iterates 

of  such  functions  can  be  handled  by  standard  methods  (see  e.g.  [de  Bruijn,  3, 
see.  8.7  or  Exercise  8.11]).  We  obtain  the  asymptotic  expansion 


4 4 &a  k , -2V 

\ ’ k 7-  + 0(k  ) , or 


k2 


„ 4 , 4 In  k , -2S 

2 - — + + 0(k  ) as  k -*■  ® . 

* k2 


(2.28) 


To  obtain  the  expansion  for  |$j  < we  assume  that  it  has  the  form 


In  k 


-2, 


fk(l  e±*)  - 2 * E *l<Sk>  + “ s2C{k>  + 0(ir‘>  • 


where 


sk  « m2  • 


For  $ fixed  we  then  have  + k ^k  SO  C^at 

1 r 

/■\  u\  snv<;  - 

f 


1 
k 

/ 1 eiA  _ 2 _ glU  +*  0 + ^(k+1) 

' 4 ' k + 1 (k+1) 


k+1 


2 g2(£  + k 0 + 0(k”2)  * 


(2.29) 


where  £ ■ 5k  • Using  the  expansion 

8-^(5  + £ 5)  ■ g1(S)  + k 8^(5)  + • • • , 


g2(C  + ^ 0 " 82(5)  +l  g2(C)  + *“  * 


1 _ 1 1 

k+1  “ k l+k 


-1 


I.  + 

k k2  k3 


Ju_  _ 1_  1 

.2  . 2 


(k+1)  k (1+k  )z  k 


-TTi"  7T  " 3 + *•' 


and 


£n(k+l)  « £n  k + - •••  , we  obtain  after  collecting  terms 


11 


Vi(i  *u)  ■ 2 - e *i<» + *2(l> 

+ 1 IgjU)  - Cg'«)]  - — [2g,(0  - Cg2<5>]  + 0(k‘2)  . 
k2  k3 

On  the  other  hand  from  (2.10)  ffc+i ( e^)  * 31  + \ (■£  » 


whence  using  (2.29)  and  e*^  ■ a'1*'  ■ l + 


we  have 


(2.30) 


- / 1 i*\  . 1 ...  . la  k 

fk+l  \4  * ) " 2 ~ k 81(5)  , 2 8 


2 *2 


(O 


+ -,  [ t g?(0  + 52]  - - gi(Og2(C)  + 0(k“2) 

k2  4 1 k 2 1 z 


(2.31) 


Comparing  terms  of  (2.30)  and  (2.31)  we  obtain 

g]U)  - C»i(0  - -J  g|(0  + ?2  , 

2gj(«)  - - I g1C5)g2<«  . 

with  initial  conditions  g^(0)  ■ g£(0)  ■ 4 from  (2.28).  These  equations 
have  a solution 


gjU)  - 25  cot  - C , g2(5)  » 52(sin  \ f-)-2 


which,  when  substituted  back  into  (2.29),  gives 


Next 


fk  (l  e^)"  2 " £ C cot  ± 5 + C2(sin  \ O"2  + 0(k’2) , 

k (2.32) 

where  5*  - id>k2  , |$|  < (~“) 

fk(i  *U)  ~ fk-l(i  •**)  " k=I  5k-l  cot  7 5k-l 


’ t «k  cot  7 5k  + «£  (sin  7 5k  )_2 

la  (k-1)  P2  . . 1 _ .-2  . n/,  -3. 

“ ? 1 2 ^k-1^  0(k  ) , 

(k-1)2  * 1 1 k 1 
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1 *c-l  £ 

and  setting  ,8°  that  k-1  * k 


, and  using 


the  expansions 


cot  y (£  - £ 0 - cot  -j  5 + 2iT  5(sin  \ 0~2 


E 1 1 2 

— - cot  ^ £ (sin  j O + • • • , 

4k2  2 2 


£n  (k-1)  * in  k - -r  + • • < 


and 


we  obtain 


(sin  \ U - £ 5))  - (sin  O (1+^5  cot  yi+- 


), 


£n  k _3  _ 1 , , , 1 „v -2  , £_  , . 1 . -2 

3—  5 cot  y C (sin  -j  4)  + -*7  (sin  y 5) 

k k 


4k 


cot  ~ t,  (sin  f O fc  + 0(k"3) 


1 ->-2 
2 


However,  for  Z2  ■ i$k2  , |$|  < , we  have 


5 cot  y 5 

so  that  we  can  write 


0(£n2k)  , sin  ■?■  <; 


1 - 1 ”2 

2 


0 as  k-«°  , 


f*(i  °1*)  - ‘k-i(i'L*)  -h  — fr:  + °(tt)  • <2-M> 

k sin  j ( v 


The  same  expansion  can  also  br:  obtained  by  again  substituting  (2.32)  into 

the  recurrence  (2.10).  If  we  now  insert  (2.33)  into  the  integral  in  (2.24) 

and  use  the  substitution  £2  ■ i$k2  , (2.24)  becomes 

n .2 


t(n,k) 


JL 

2iri 


sin2 


+ 


(2.34) 
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where  Che  path  of  integration  f a a determined  from  the  condition 


is  (see  Figure  2.2): 


T("l  + i) 

for 

— £n2k  < t < 0 

/2 

T(1  + i) 

for 

0 < t < -i— 

n 


The  integrand  is  a meromorphic  function  with  poles  at 


5 “ 2ma  , m«*±l,±2,..., 

m 

and  corresponding  residues 
rog  - 4[: 


(2.35) 


|3(27rm)2  - 2 ^ (2ffm)4J 


- \ (2  mn)  2 
k 


(2.36) 


Before  applying  the  residue  theorem,  we  have  to  close  the  path  of  integra- 
tion, for  instance  by  the  arc  (see  Figure  2.2) 


. -10 
5 ■ pk  e • °k 


Zxl'V.  , - -T  < 9 < -T 


4 - - 4 * 

Deforming  the  arc  in  the  neighborhood  of  the  real  axis  so  that  it  passes 

i 1 2 2 2 

approximately  in  between  the  two  poles  we  have  from  | sin  z|  ■ sin  x + sinh  y 
| sin  jCj"  1 + n , n > 0 


along  the  arc  for  large  enough  k . Hence 
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/ 


- — £2 

— . “2  d5 


'A  Sl“2  I 5 


< (l+n)o, 


tr/2 

i 


n 2 

- — p.  sin  6 

k2  k 


de 


ir/2 


< (l+n)p. 


£ 


2n  O2  A 

“ ~2  Pk  9 
k 


de 


(l+n)ir  p£  k2 

2n 


n 2 

,.2  pk 


1-e 


2 2 

(l+n)tr  p k n 8 

<_ — p.  ■ 0(£n  k)  as  k -*■  <* 

2n  k2 


(2.37) 


Finally,  applying  che  residue  theorem  to  the  integral  over  the  contour  r + A 
and  using  (2.34),  (2.36)  and  (2.37)  we  have  the  asymptotic  expansion 


t(n,k)  ■ 4 


n+1 


l"  S[2k2 

m>l  L 


(21m)1*  - 3(2:rm)2  a 


- ^r(2im)2 


+ * 4”o(^)  ♦ «■«(&£)  • 


(2.38) 


From  here  we  can  get  the  distribution  of  heights  of  binary  trees  with  n 
or  equivalently  the  probability  that  a randomly  selected  binary  tree  has  height 
k if  all  binary  trees  with  n vertices  are  considered  equally  likely. 

Calling  this  quantity  ?n(k)  we  have 

Pq0o  . ^ (2 . 39) 
n 


where  t(n,k)  is  the  Catalan  number  (2.7).  Using  the  asymptotic 


expansion  [6]  C 


n 3/2  jz 
n vn 


+ 0 ( 4nn  we  obtain  for  large  k and  n. 
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Pn00 


• /T  *2E  j^28(2irm) 4 - 3(27rm)2  J, 
m>l 


-(2Tnn)28 


(2.40) 


+ 0(V/2  -/“3/2  ^ 


3/2 


)+o(a^)+  o(^), 


where  8 * — *■  . Note  Chat  if  k < n , which  is  the  case  of  interest,  the 

k 

dominant  O-term  is  the  last  one.  Thus,  if  for  some  arbitrarily  small  5 > 0 


3/8  + 5 , . 

n <_  k £ n , 


(2.41) 


then  as  n ■+•  « 


0O 

PQ(k)  ~ vT  02  12  [28(2ira)4  - 3(2rm)2]  e ‘(2inn)  8 . (2.42) 


m-1 


To  verify  that  (2.42)  is  Indeed  a probability  distribution  note  that  as 
n -*■  - 

m OB 

2. 


£\n(k)  - af\  /V  ^ [2SC2Tnn)4  - 3(2™) 2 ] e "(27nn) 

k J m-1 


8 


dk  , 


where  the  limits  of  summation  and  integration  are  given  by  (2.41).  Making  a 
substitution 

/T 

we  have  as  n -*■  - 


r,  Yj  io-)"  / 283/2  e dS 


m-1 


/ 


OB 

7 


- (2rm)2  / 361/2  e '(27ra)  S d8 
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But 


00 


e-(2ina)2S 


dg 


2g3/2  e-(2m)  S 


00 

/ 


+ (2jna)2  / 233/2  e'(2inn)  3 d8  ’ 


and  we  are  lefc  with 


lim 

e-*o+ 


4^  £ 2 (2™) 2 e -(2’n")  3 


0-1 


90 

- -*-/ 

a-  J 


, -U2 

^ e du  ■ 1 


by  the  substitution 


u2  ■ (2iTm)  3 . 


If  we  denote  .X  as  the  random  variable  with  distribution  p (k)  and  call  the 
n n 

asymptotic  distribution  function 


F(x)  - 


lim  P 
n-*« 


(2.43) 


we  obtain  by  integrating  the  right-hand  side  of  (2.42)  over 
by  the  same  method  as  above 

F(x)  - 4x'3  ,5/2 

0*1 


0 < k < 2x  ^ 


(2.44) 


It  can  easily  be  seen  from  (2.43)  that  for  x - o(Vn)  F(x)  ■*  0 and 
thus  the  asymptotic  distribution  (2.42)  or  equivalently  (2.43),  (2.44)  is 
valid  for  all  k in  the  range  (2.1)  as  n ■*  ».  (See  Appendix,  Figure  A.l). 
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Tala  is  identical  witu  the  distribution  function  obtained  by  Renyi  and 
Szekeres  (see  [8]  p.  506)  for  the  height  of  general  (as  opposed  to  binary) 
trees.  Only  tne  normalizing  factors  in  (3.43)  differs  by  a constant,  namely 
v2  . More  precisely,  if  .)£  * is  the  height  of  a general  tree  with  n nodes 
(i.e.  with  no  restriction  on  the  number  of  successors  of  a node)  then  for 
large  n 

X, ~ ^2  X*  , (2.45) 

n n 

a somewhat  surprising  result. 


3.  BINARY  TREES  GENERATED  BY  RANDOM  PERMUTATIONS . 

As  described  in  the  introduction  the  tree  insertion  algorithm  defines  a 

map  which  assigns  to  every  permutation  tt(1 n)  a binary  tree  with  n 

nodes.  If  all  permutations  of  integers  l,...,n  are  considered  equally 
likely  the  resulting  trees  are  referred  to  as  random.  Thus,  every  numerical 
quantity  defined  on  a binary  tree  becomes  a random  variable. 

Let  be  the  height  of  a random  binary  tree  with  n nodes,  i.e., 

generated  by  random  permutations  of  the  first  n integers.  Clearly,  the 
probability 

P(H  < k)  - K B(n,k)  , (3.1) 

n — n. 

where  B(n,k)  is  the  number  of  permutations  ir(l,...,n)  mapped  into  trees 
with  height  not  exceeding  k . (See  Appendix.,  Table  A. 2)  . 

In  order  to  obtain  an  explicit  expression  for  (3.1)  we  first  need  a 
suitable  indexing  system  for  the  nodes,  i.  natural  way  to  do  this  is  to  con- 
sider first  a full  binary  tree  and  label  its  nodes  by  the  sequence  of 
positive  integers  starting  from  the  root  and  labelling  in  each  subsequent  level 
from  left  to  right.  (See  Figure  3.1). 
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» 


level  1 
level  2 
level  3 
• • « 

FIGURE  3.1 

Thus  nodes  at  level  j , j » 1,2,...,  from  left  to  right  have  labels 

2j_1  + m , a - 0.....2-3"1  . (3.2) 


1 


Note  that  the  left  and  rignt  successor  of  a node  labelled  x have  labels 
2x  and  2x  + 1 respectively,  left  successors  always  have  even  labels,  right 
successors  have  odd  labels  ( the  root  being  an  exception) . 

Next  consider  a fixed  binary  tree  with  a nodes  and  for  every  label  x 
of  the  fora  (3.2)  define 

( 0 if  there  is  no  node  with  label  x , 
d(x)  - j (3.3) 

( 1 + d(2x)  + d(2x  + 1)  if  there  is  a node  with  label  x . 

Note  that  d(x)  is  simply  tne  number  of  nodes  in  a subtree  with  root  at  x , 
in  particular  d (1)  * n and  d(x)  * 1 if  and  only  if  the  node  labelled  x 
has  no  successor,  i.e.,  is  a leaf. 

Lemma:  There  is  a one-to-one  correspondence  between  binary  trees 

with  n nodes  and  height  not  exceeding  k and  the  set  D(n,i) 
of  vectors 

(d ( 1) ,d(2) .... ,d(2k  - 1)) 

with  non-negative  integral  components  satisfying  the  conditions: 
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(1)  d(l)  - n , 

(2)  For  all  j ■ 1, ...  ,1c 

2J-1  <_  x < 2^-1  =*-  0 <.  d(x)  < 2k_j+1  - 1. 

(3)  d(2x)  + d(2xfl)  > 0 =>  d(x)  - 1 + d(2x)  + d(2x+l)  . 


Proof:  Given  a binary  tree  the  numbers  d(x)  are  uniquely 

decined  by  (3.3).  Property  (1)  is  obvious,  property  (3)  follows  from 
(3.3)  since  nodes  2x  and  2x  + 1 are  successors  of  node  x , hence 
if  there  is  a node  with  labels  either  2x  or  2x  + 1 there  must  be  one 

with  label  x . Property  (2)  is  necessary  since  if  it  were  violated 
then  there  would  be  a subtree  with  root  at  level  j having  at  least 
2^~^+^  nodes  »nd  the  height  of  the  tree  would  then  exceed  k . Con- 
versely, given  a vector  (d(l) , . . . ,d(2^-l) ) satisfying  (1)  - (3), 
construct  first  a complete  binary  tree  of  height  k and  then  eliminate 
all  nodes  with  labels  x such  that  d(x)  » 0 . Properties  (1)  - (3) 
then  guarantee  that  the  result  is  a binary  tree  with  exactly  n nodes. 

A vector  (d(l) , . . . ,d(2  -1))  e D(n,k)  can  be  used  to  compute  the  number 
of  permutations  mapped  into  the  tree  corresponding  to  this  vector.  Consider 
a permutation  7r(l,...,n)  ■ (s^,...,sq)  mapped  into  a binary  tree  with  height 
not  exceeding  k and  look  at  the  node  into  which  a particular  symbol,  say 
s^  , is  mapped.  Let  x^  be  the  label  of  this  node  and  let  L(s^)  and  R(s^) 
be  the  ordered  subsets  of  (s^,...,sq),  which  are  mapped  into  the  left  and 
right  subtrees  of  the  node  x^  . By  the  very  nature  of  the  mapping  if  the 
elements  of  L(s^)  U R(s^)  are  reshuffled  the  tree  is  not  changed  as  long  as 
the  order  of  elements  within  each  subset  L(s^)  and  R(s^)  of  L(s  i)  U R(si)  is 
preserved.  But  the  numbers  of  elements  in  L(s?  ) and  R(s^)  are  d(2x^) 
and  d(2x^  + 1)  respectively  so  that  the  nurber  of  permutations  resulting 
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from  such  a reshuffle  Is 


U(2Xi)  + d(2Xi  + 1)]!  [d(x±)  - 1]! 

d<2X±) : d(2Xl  + 1)!  " d(2X±) ! d(2x±  + 1)!  * 

using  (3.3)  and  d(X^)  > 0 (since  s^  Is  mapped  into  a node  labelled  X^) . 
Repeating  this  argument  for  each  node  of  the  tree  corresponding  to 
(d(l)  , . . . ,d(2  -1))  gives  the  total  number  of  permutations 


fl  [d(X)  - 1] ! 

11  d(2x)  ; d(2X  + 1) ! * 


where  the  product  is  over  all  d(X)  >0,  X • 1 2 

using  the  lemma  and  (3.1)  we  obtain  the  formula 


k-1 


1.  From  here 


P(H  < k)  - -^r  y ' n — t-d  W— 1.  13  • 

( Q - } ^defek)  d^>:  d^!  * 


(3.4) 


Note  that  if  d'  • (d ' (1) , . . . ,d'  (2k  - 1))  is  obtained  from  d - (d(l) , . . . ,d(2k-l)) 


by 


d'00 


d(X)  if  d(x)  > 0 
1 if  d(X)  - 0 


(3.5) 


the  formula  (3.4)  takes  on  a simpler  form 


P (H  < k) 
n— 


-1 


d’(x) 


(3.6) 


Unfortunately,  eXcept  in  a few  special  cases,  the  set  D(n,k)  is  quite 
complicated  for  (3.6)  to  be  useful  for  computation.  Again  a recurrence  re- 
lation may  be  preferable. 

Indeed,  such  a recurrence  is  quite  easy  to  derive.  Consider  the  left 
and  right  subtree  of  the  root  of  a random  binary  tree  with  n nodes.  If 
J is  a random  number  of  nodes  in  the  left  subtree  then  we  have 


Vl  ' 1 + <'j  • Hn-J>  ' 


(3.?) 
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Now  J is  simply  Che  number  of  symbols  in  a random  permutation 

ir(l, . . . ,n+l)  ■ (s^, . . . ,sq+^) , which  are  less  than  s^  . Therefore,  J is 

uniformly  distributed  over  {0,...,n},  and  H.  and  H _ are  conditionally 

J n-j 

independent  given  J . Consequently 


P(Hq+1  - l<k|J-J)-  P(«j  i k>p(Hn.j  - k)  » 

from  which  by  calling 


F(n,k)  - P (Ha  < k) 


and  taking  the  expectation  we  obtain  the  recurrence 

n 


valid  for 


F(n+l,k+l)  - ^ F(j ,k)F(n-j  ,k) 

j-0 

n i 0 , k >_  0 if  we  define 


(3.8) 


(3.9) 


F(0,k)  - 


1 if  k - 0 , 


0 if  k > 0 . 


It  may  be  noted  that  (3.9)  gives 


F(n,k)  ■ 1 for  0 < n < k , and 


F(n,k)  ■ 0 for  n > 2* 


as  expected. 


} 


(3.10) 


Now  (3.9)  can  be  used  to  calculate  the  numbers  F(n,R)  for  moderate  values 
of  k and  a , (see  Appendix,  Tables  A. 2 and  A. 3,  and  [2]),  however  the  memory 
requirements  increase  rapidly. 

It  can  also  be  used  together  with  (3.10)  for  some  special  choice  of  n 

It 

and  k . For  instance  for  n close  to  2 -1  we  obtain 

, k > 1 , (3.11a) 


F(2k-i,k)  - n l—L.) 

j-1  \ 2^-1  / 


F(2k-2,k)  - <2k-l)F(2k-l,k)  , k 1 1 , 


F (2k-3,k)  - (2k— 1)  (2k”1-l)F(2k-l,k)  , k > 2 , 


(3.11b) 

(3.11c) 
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and  a few  more.  Note  that  (3.11a)  is  the  probability  of  obtaining  a full 
tree  of  height  k . However,  these  few  terms  are  of  minor  interest. 

Another  possible  approach  is  to  use  generating  functions.  Defining 


00 

fk(x)  - ^^F(k,n)xn  , k >_  0 , 


n*0 


we  have  Immediately  from  (3.9)  the  relation 

x 


fk+l(x)  * 1 


+ / f*2< 


(y,dy  , f q (x)  * 1 


(3.12) 


(3.13) 


Note  chat  fk(x)  are  again  polynomials,  fk(0)*l  , and  that  for  all  x e (0,1] 
fk(x)  < fk+1(x)  -*>  (1+x)-1  as  k—  . 


It  is  hoped  that  (3.13)  can  be  used  to  obtain  an  asymptotic  distribution  of 
the  heights  Hq  . However,  we  have  not  been  successful  in  that  respect 
to  date. 

It  has  been  suggested  to  us  by  A.  Washburn  that  a lower  bound  on  the 
expected  height  E{Hq)  can  be  easily  obtained  from  (3.7).  Taking  expecta- 
tion we  have 

E{Hn+i}  ■ 1 + S{max(Hj  , Ha_j)  > (3.14) 

and  conditioning  upon  J 

E{max(Hj,  Ha_j)|  J«j  } >_max(E{Hj),  E{Hn_j})  . 

Since  J is  uniformly  distributed  over  {0,...,n}  this  implies 

n 

E{max(HJf  Hn_j)}  > ^3-  ^ max(E{Hj}  , EtH^})  . (3.15) 

J-0 

Hence  if  a , n * 0,1,...,  is  a sequence  of  numbers  defined  recursively  by 
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(3.16) 


l V' 

+ L max(aj’Vj)  • ao " 0 * 


0+1 


j-o 


It  Is  easily  seen  from  (3.14)  and  (3.15)  that 


E(H  } > a for  all  n ■ 0,1,... 
n — u 


Note  that  aQ  Is  a strictly  Increasing  sequence  so  that 


n 

T>(VVj)  ■ 

j-o 


a.  for  n odd  , 


n/2< j<a 


\/2  + 2 


Oj  for  n even. 

n/2<j<n 


Since  clearly  aQ  ■ 0(n)  we  have  as  n • 


a — 1 + 
n 


n/2<j<n 


which  upon  approximating  the  sum  by  an  integral  yields 

1 

<x(t)  '**1  + 2 / a(ty)dy  . 

*1/2 


On  the  other  hcnd,  let  Y be  a random  variable  defined  by 


Then 


1 

r j 

if 

ht 

> H T > 

nY  - 

l 

| n-J 

if 

J 

HJ 

— n-J  * 

< H T . 
n-J 

nY  < max{J,n-J}  , 


(3.17) 


(3.18) 


(3.19) 
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and  by  (3.7) 


E{Hi*l}  " 1 + E{HnY}  * (3,20) 

wow  for  large  n , Y Is  uniformly  distributed  over  ( , 1) , and  since 

U(n)  - E{Hn> 


is  an  increasing  function  of  n we  get  from  (3.20)  by  conditioning  on 
applying  (3.19)  the  asymptotic  inequality 


u(n)  <1+2 


u(ny)dy 


Y 


and 


(3.21) 


valid  as  n ■+■  » . But  this  together  with  (3.17)  and  (3.18)  indicates  that 
we  should  have  in  fact 


u(n)  — a(n)  as  n -*•  » 


(3.22) 


where  a(t)  is  a solution  of  (3.18).  It  is  easily  verified  that  (3.18)  has 
a solution 

/.v  Za  t 
a(t)  ‘ 'PETT  + 0 • 

where  a is  an  arbitrary  constant.  But  as  u(n)  -*■  » the  constant  can  simply 
be  disregarded  and  we  have  an  asymptotic  equivalence 

li(n)  ~ (1  - £n  2)"1  £n  a » 3.25889  Zn  n (3.23) 


as  n -*■  » . (See  Appendix,  Tables  A. 2 and  A. 3). 
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4.  FURTHER  RESULTS  FOR  RANDOM  BINARY  TREES. 

We  now  turn  our  attention  to  some  other  quantifier  associated  with  random 
binary  trees.  In  particular  we  wish  to  study  the  process  of  growing  the  tree 
as  the  number  of  nodes  increases.  A convenient  way  to  visualize  such  a process 
Is  to  consider  a sequence  X^^,...  of  independent  random  variables  with  a 
common  continuous  distribution  and  take  as  our  permutation  the  ranks  of 
X^,...,Xq  for  each  n - 1,2,...  . Clearly,  for  each  fixed  n all  resulting 
permutations  are  equally  likely,  and  each  new  symbol,  i.e.,  rank  of  X , , , is 
uniformly  distributed  over  {l,...,n+l}. 

If  for  some  n we  have  a random  binary  tree  then  a new  symbol  will  be 
mapped  into  a new  node.  This  new  node  can  appear  in  one  of  n+1  possible 
locations  in  the  current  tree.  Following  Knuth  [6]  we  will  refer  to  these 
locations  as  vacancies  of  the  current  tree.  They  are  depicted  in  Figure  4.1 
as  empty  squares. 


FIGURE  4.1 


Let  for  n > 1 , j > 2 , V . be  the  number  of  vacancies  at  level  j at 

n • j 

time  n , i.e..  in  a random  binary  tree  with  n nodes.  Clearly  Vq  ^ - 0 
for  j > a+i  and 


v - + 
n,2 


V , +• 
n,3 


•+  V 


a, n+1 


■n  + 1 


(4.1) 
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Also,  obviously 


Vl,2-2  • 

Suppose  now  that  at  time  n the  vacancies  have  values  v v . 

rr  n,n+l 

Since  a new  node  is  equally  likely  to  fill  any  of  the  :urrent  n+1  vacancies, 
the  probability  that  It  will  fill  a vacancy  at  level  j equals 


-S al_ 
n + 1 


But  then 


while 


Vn+l,j  " Va,j  " 1 ’ 

Vn+l,j+l  " Vn,j+1  + 2 * 

Vn+1 , 1 " vn,i  for  a11  ^ ^ +1  • 


Thus  If  we  define  a sequence  of  random  polynomials 

Ha«  ‘ E Vn,j  *J'1  - "i1  • 

Ji2 

we  see  that  these  polynomials  form  a Markov  chain  with  transition  probabilities 

Wn+l(x)  " wn(x)  + xj_1(2x'1)  I Vx>  * wn(x))  * T+^T  ’ 
n >_  1,  j ■ 2,...,n  + 1 , and  the  initial  state 

W^(x)  * 2x  . In  particular  we  then  have 


E{  Vi(lt)  I Vx)}  • Vx>  + 

■ .-ii- 

EK<x>}-(1  + ¥i)(1  + rE1i 

- It  2x(2x+1)  •••  (2x+n-l)  - ^r 
n.  n. 


n+2 

y xj_i(2x-i) 

j-2 


p»j 

n + 1 


Consequently 


n+1 

E 


j-2 


(2x) 


j-1 


(4.2) 
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where  £ j are  Stirling  numbers  of  the  first  kind  in  Knuch's  definition  and 
notation  [6].  Thus 


E 


j • 2, . . . ,n  + 1,  n>l, 


(4.3) 


gives  us  the  expectation  of  the  number  of  vacancies  at  various  levels.  (See  Appendix, 
Table  A. 5).  Other  quantities  of  some  interest  may  be  obtained  from  here. 

For  instance  the  expected  distribution  of  the  number  of  vacancies  over  levels 
is  Immediate 


j - 2, . . . ,n  + 1 ; a > 1 . 

Next  let  Kq  be  the  number  of  comparisons  needed  to  insert  a new  (n+l)st 
symbol  into  a binary  tree  by  the  algorithm  described  in  the  Introduction. 
Then  the  probability 

P(Kq  - k)  - e{p(Kq  - x|WQ(x))|-  E - vn0c  + 1)  , 

ie*l, . . . ,a  , n ^ 1 , 

since  the  number  of  comparisons  is  k if  and  only  if  the  symbol  fills  a 
vacancy  at  level  <+1  . Recalling  that  the  Stirling  number  is  also  the 

number  of  permutations  of  n symbols  with  exactly  < cycles  we  can  write 

p(Kn  • *>  ■ S+l  ’nW  - 

where  q^ic)  i3  the  probability  that  a random  permutation  of  n symbols  has 
k cycles.  (One  wonders  whether  the  number  of  comparisons  can  be  related  to 
uhe  number  of  cycles.)  (See  Appendix,  Table  A. 7). 
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With 


P(Kn  - *>  ' 75ij 


rC]  • 


ic  ® 1, . . . ,n  , 


the  generating  function  (by  definition  of  the  Stirling  number)  is 

n 

< n 2x  + <-l 


0n(x) 


Ztj/1  \ K n 2x  + <- 

p(kn  ’ ■ n TTT 

k-1  ‘■1 


Hence  k is  a convolution,  k 
n * n 

pendent  Bernoulli  variates: 


1 

0 


Consequently 


u„  ■ E(1v 


x,  + •••  + x , where  the  x 's  are  inde- 
1 n ’ k 


•K  K+l 
K-1 

n 

P 


■ 2(I+j 


<■1 


♦ wi) 


2<Hn+l  - » • 


with  Hn  representing  the  Harmonic  number  [6]  (See  also  the  Riemann  Zeta 

function  in  Abramowitz  and  Stegun  [1].)* 

n n n 


a ■ Var 
n 


fkn>-  2 


p q - 2 

K K 


<■1 


^ <+l  (<+1)2 
K*1  <“1 


2«n+l  - » - 4<i!a+i  ' » ‘ 2 “n+1  * 4 Hn«  + 2 ' 


Since  asymptotically  [5] 


\ ■ * + *»a  + ir  + °(h)’ 

n 

(2)  -^2  i / 1 \ 

“n  -r-sr  + 0(-)  • 


with  y Euler's  Constant,  0.5772..., 
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we  obtain  for  Che  number  of  comparisons 


V ~2{y-l  + £n(n+l) } - 2 £n(ofl)  - 0.8456  •••  , 

a 

2 2W2 

a ~2{y+1  ~ ■=?-  + £n(n+l) } - 2 £n(n+l)  - 3.4253  •••  . 

u 0 


Since  <_  1 and  oq  -*•  <■  , the  central  limit  theorem  applies, 


yielding 


k - u 

PI  -2- “ < x)  - *(x) 


f 

fa  J 


x 


dy  . 


Also,  by  the  Berry-Esseen  Theorem  (c£.  Feller  [4],  p.  544) 


sup  P 


(Vs)- 


$(x) 


< 6 


where 


Cn  ‘ H Eix«  ' "*|3  “ S PA  (P«  + "0 


K-l 


•C-1 


Asymptotically , 


r ■ 

n 


-2  (h(4)  - l)  + 4 (h<3>  - l)  - 3 (V2j  - l)  + H - 1 
\ n+1  / \ n+1  / \ n+1  / n+1 


TT 
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+ 4(0.20205  •••)  - y- 


+ y + £n(n+l)  + 1 


* £n(n+l)  - 4.714  •••  , 

so  that  the  bound 

1 

^o(n+l) 

which  gives  a fairly  high  rate  of  convergence  to  normality  for  the  distribu- 
tion of  the  number  of  comparisons  to  insert  a new  symbol.  The  results  given 
can  be  used  to  analyze  a complete  binary  insertion  sort  algorithm  operating 
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on  a random  string  of  symbols.  Higher  order  expansions  are  also  possible  for 
describing  chis  process. 


Further,  let  M be  the  number  of  nodes  at  level  j in  a random  binary 
ft*  J 

tree  with  n nodes.  Since  clearly 


M . . . + V . .. 

a.j+1  ft.J+1 


2Mq  j , J " 1* • • • ,n  , 


with 


M . - 1 , . . *0  , n > 1 , whence 

Q)X  n i tt  r x 


Mn  « X ^ ^ ( T ) ^n  r+1  ’ or  uPon  ta^ing  the  expectation 

r-j 


E{M 


,*  - # E [:] 

r-j  u J 


, j— 1, ... ,ft  , n > 1 


(4.5) 


(See  Appendix,  Table  A. 4). 
Note  that 


/ 1 

n 

1 M 1 

l l v rni 

KJ-i  n*J| 

n:  ^ w 

r-j 

is  an  expected  fraction  of  nodes  occupied  at  level  j . It  may  be  called  an 
expected  relative  thickness  of  the  tree  at  level  j . Looking  again  at  Figure 
4.1  we  see  that  we  can  distinguish  between  two  kinds  of  vacancies,  those  which 
are  attached  to  a leaf  (and  hence  come  in  pairs)  and  those  attached  to  an  in- 
ternal node  of  the  tree.  We  will  refer  to  these  two  kinds  as  twin  and  single 
vacancies  respectively. 

Let  for  n >_  1,  j ^2,  be  the  number  of  single  vacancies  and 

n * j ft , j 

the  number  of  twin  vacancies  at  level  j in  a random  binary  tree  with  n nodes. 
Clearly 


+ V 


(2) 

n»  j 


* 


- 0 


2 . 
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If  at  time  n Che  values  of  Chese  variables  are 


(2}  (2) 

vj  ; .....  v:-:.,  and  v_  ; ,...,  vv 


,a>  v(« 

'o,2  »•*•»  vn,n+l 


n,2 


n,n+l  ’ 


Chen  filling  a single  vacancy  ac  level  j resulcs  in 


V<i>  , - v(l>  - 1 . V«>  +2  . 

Q+l.j  n,j  afl,j+l  n,  j 


wich  probabilicy 


n + 1 


in 


,(!) 

n+l,j 


(1)  . . 
v ' , + 1 
n»j 


wich  probabilicy 


,<2> 

“.4 


while  filling  a twin  vacancy  at  level  j resulcs 


2<2>  . - v<2>  - 2 , VC2) 

n+l,J  u.j 


(2) 

n+l,j+l  " vn, j+1 


+ 2 , 


n + 1 

If  we  again  introduce  Che  random  polynomials 


The  remaining  vacancy  numbers  are  not  changed  . 


W(1)  » y V(1}  x*  1 , W(2)  - y V(2j  xj_1  , 
“ La  “.J  a La  n,j 

it2  . it2 


we  obtain  for  their  expected  values  Che  equations 


Since 


E W£l  <X> 


ElWn1>(x) 


+ ^rE k2)(x)  * 


wnil(x)  - E k1)(x)  + 


^-E  Wn2)(x) 


n > 1 , with  E j (x)  | - 0 , E j W^2)  (x)  J - 2x  . 
wjw^(x)  | + E j W^2)  (x)  | 


E j Wq(x)|  we  obtain  by  substitution 


E »“>  (x) 


n-1  _ 
n+1  ^ 


W*1* (x) 

n 


+ 5TE  Vx>  * 


from  which  by  using  (4.2)  we  have 
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j * 2,..., a ; a > 1 , while  Che  corresponding  expression  for  cwin  vacancies  is 
obtained  by  subtracting  (4.7)  from  (4.3)  with  (4.7)  set  equal  to  zero  for 
j ■ n+1  . (See  Appendix,  Table  A. 6). 

We  conclude  this  section  by  computing  the  expected  number  of  leaves  in  a 
random  binary  tree  with  n nodes.  From  the  first  equality  in  (4.6)  the  ex- 
pected number  of  all  single  vacancies  equals 

n-1  n-1 

'dpiT  E tE  k<l)  Er(r+1>  1 

r»l  r»l 

since  by  (4.2)  E (Wq(1)  }■  n+1  . The  latter  sum  equals  y (n+1)  so  that  the 
expected  number  of  all  twin  vacancies  is  n+1  - y (n+1)  . The  expected  number 
of  leaves  is  clearly  half  the  number  of  twin  vacancies,  that  is  y (n+1)  . 
Thus,  in  a random  binary  tree  on  the  average  about  y of  the  nodes  are  leaves. 


E W^U) 


i 
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Remark:  Having  In  mind  Che  process  of  growing  random  binary  trees  as 


described  above  we  can  also  look  at  a random  process 


Nj.  ■ min  | a : Hq  > k j , Nq  ■ 1 , (4.8) 

l.e.  the  time  (•  number  of  symbols)  needed  to  grow  a tree  over  the  height 
k.  (See  Appendix,  Table  A. 8).  From  (4.8)  clearly 

> n)  - P(Hq  < k)  - F(n,k) 

so  that  g^(x)  ■ 1 - (l-x)f^(x)  is  tne  ordinary  probability  generating  function 

gk(x)  - ^ PO^  - n)xQ 
n>l 


of  the  random  variable  . Denoting 


il(m,k)  * E 


W15  *“  (Nk  " ■ + i) 


the  m-*th  factorial  moment  of  and  using  the  fact  that 

.(m)  .(m-l) 

5(m,k)  - — g^l)  - m ^ fk(l) 

dx  dx 


we  obtain  by  applying  Leibnitz  formula  to  (3.13)  the  relation  ' 

m 

u(nrt-l,k+l)  - j+l)  y(j+ltk)u(m-*j+l,k)  , 

J-0 

m > 0 , k > 0 . In  particular  with  m"0  this  becomes 

E i\+l(Nk+l  ' l)[  " 2(S(Jik  } ^ * k 1 0 • 


or 


Var  (\+1  > ' 2®  l\  i>2  + * < Vl  >[l  - E <\+l  >_  ' 


Unfortunately,  it  is  the  first  moment  E {N^  },  which  is  hard  to  obtain  for 
large  k . (See  Appendix,  Table  A.  7). 
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Symbols 
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TABLE  A. 1 - Binary  Trees  with  n Nodes  and  Height  k,  t(n,k)  (Eq.  2.3) 


Symbols 
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TABLE  A. 2 - Partitions  of  Permutations  of  n Symbols  into  Binary  Trees  of  Height  K 


Minimum  Mean  Variance 
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TA£LE  A. 3 - Random  Binary  Tree  Height  (Adapted  from  Eqs.  3.9,  3.23) 


Symbols 


1 


I 
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>xpected  Nodes  by  Level  in  Random  Binary  Trees  (Eq 
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.xpected  \hcancles  by  Level  in  Random  Binary  Trees  (Eq.  4.3) 
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TABLE  A. 6 - Expected  Single  Vacancies  by  Level  in  Random  Binary  Trees  (Eq.  4.7) 
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Number  of  Symbols 
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Height  Exceeding  k (Eq.  4.8) 
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